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Abstract 

In this paper, a point-to-point Orthogonal Frequency Division Muhiplexing (OFDM) system with 
a decode-and-forward (DF) relay is considered. The transmission consists of two hops. The source 
transmits in the first hop, and the relay transmits in the second hop. Each hop occupies one time slot. 
The relay is half-duplex, and capable of decoding the message on a particular subcarrier in one time slot, 
and re-encoding and forwarding it on a different subcarrier in the next time slot. Thus each message is 
transmitted on a pair of subcarriers in two hops. It is assumed that the destination is capable of combining 
the signals from the source and the relay pertaining to the same message. The goal is to maximize the 
weighted sum rate of the system by jointly optimizing subcarrier pairing and power allocation on each 
subcarrier in each hop. The weighting of the rates is to take into account the fact that different subcarriers 
may carry signals for different services. Both total and individual power constraints for the source and 
the relay are investigated. For the situations where the relay does not transmit on some subcarriers 
because doing so does not improve the weighted sum rate, we further allow the source to transmit new 
messages on these idle subcarriers. To the best of our knowledge, such a joint optimization inclusive 
of the destination combining has not been discussed in the literature. The problem is first formulated 
as a mixed integer programming problem. It is then transformed to a convex optimization problem by 
continuous relaxation, and solved in the dual domain. Based on the optimization results, algorithms 
to achieve feasible solutions are also proposed. Simulation results show that the proposed algorithms 
almost achieve the optimal weighted sum rate, and outperform the existing methods in various channel 
conditions. 
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OFDM, decode-and-forward relay, power allocation, subcarrier pairing, optimization, continuous 
relaxation, Lagrange dual problem. 

I. Introduction 

For an Orthogonal Frequency Division Multiplexing (OFDM) system with relay, identifying a proper 
way to allocate resources to the source and the relay is the main bottleneck for achieving good per- 
formance. In this paper, we consider a point-to-point OFDM system with a decode-and-forward (DF) 
half-duplex relay. Each message is transmitted in two hops each occupying one time slot. A message 
transmitted by the source on one subcarrier in the first time slot is, if successfully decoded by the relay, 
forwarded by the relay to the destination on one (not necessarily the same) subcarrier in the second time 
slot. With the assumption that the channel state information (CSI) is known at the source, many works 
have been done to make resource utilization of this system more efficient. 

A general downlink Orthogonal Frequency Division Multiple Access (OFDMA) relay system with 
individual power constraints at one source and many relays was considered in HI. In that work, joint 
optimization of the subcarrier selection and power allocation was done. However, that work assumed that 
a message is received by a destination either directly from the source, or from a relay which forwarded 
the message. Destination combining of the signals directly from the source and forwarded by the relay 
pertaining to the same message was not considered. In addition, as each relay collectively uses its active 
subcarriers to forward messages to different destinations, a more complicated re-encoding scheme has to 
be used by the relay to fit the received message for a particular destination into the subcarriers designated 
to that destination. In ||3-||4l, optimal power allocation for OFDM with DF relaying and fixed source 
and relay subcarrier pairing was proposed. 121 H considered two kinds of power constraints: one is 
that the total transmit power is shared between the source and the relay; the other has individual power 
constraints for the source and the relay. In ||5l-||7l, both power allocation and subcarrier pairing were 
considered for OFDM systems with relaying under the total power constraint. However, power allocation 
and subcarrier pairing were optimized separately. IS proposed a subcarrier pairing method by sorting 
the subcarriers of the source -relay (SR) link and the relay-destination (RD) link, respectively, according 
to their channel gains. The SR subcarrier and the RD subcarrier with the same respective ranks are 
then paired together. The optimality of this sorted channel pairing (SCP) scheme, in the absence of the 
source-destination (SD) link, for both DF and amplify-and-forward (AF) relaying schemes were proved 
in in Q. SCP was also proposed in |[8l- |[TT1l for OFDM AF relaying systems without the SD link, and in 
ifTll when the SD link and destination combining are present. Power allocation with total and individual 
power constraints for OFDM AF relaying systems were considered in ifTOl and |[T2l . while Q focused 
on only the total power constraint. The above works dealing with power allocation for the OFDM AF 
relaying systems usually used approximations to relax the problem into a solvable one. Without making 



DRAFT 



3 



any approximations, |[T3l investigated the optimal power allocation problem for the OFDM AF relaying 
systems with fixed subcarrier pairing and total power constraint in the absence of the SD link. 

In view of the lack of joint optimization of power allocation and subcarrier pairing for OFDM systems 
with DF relaying in the literature, the goal of this paper is to solve this problem with the presence 
of the SD link and destination combining of signals from the source and the relay. Both the total 
power constrained system and the individual power constrained system are considered. For the total 
power constrained system, we formulate the joint power allocation and subcarrier pairing problem as a 
mixed integer programming problem whose optimal solution is hard to obtain. We then use some special 
properties of the system and the continuous relaxation HI |[T4l to reform the problem and solve the 
dual problem by the subgradient method ifTSl . With both the power and subcarrier pairing constraints, 
the optimization problem becomes very complicated, and the duality gap may not be zero. However, 
as verified by |[T6l ifTTl and our own simulation, the duality gap is virtually zero when the number of 
subcarriers is reasonably large. Thus the dual optimum value becomes a very tight upper bound for the 
primal optimum for most practical systems. In addition to the duality gap, some other practical issues such 
as algorithm design and complexity comparison are also discussed. We then extend the formulation to 
have individual power constraints, and find that the complications caused by individual power constraints 
can be alleviated in the dual domain. The dual optimum value is again a very tight upper bound for the 
primal optimum. 

Finally, we relax the constraint that only the relay can transmit in the second time slot. Therefore, 
additional messages may be transmitted on the idle subcarriers in the SD link in the second time slot, 
when it is deemed that relaying on these subcarriers does not improve the weighted sum rate. Such a 
model was also considered in [4]. However, |@1 optimized power allocation (and relaying modes) only 
for a particular subcarrier pairing scheme without weighting of the rates. These conditions made the 
problem easier to solve. In this paper, we consider joint optimization of power allocation and subcarrier 
pairing with weighted rates. The problem is more general and difficult. However, by defining an additional 
indicator, we can formulate the problem similarly as in the case without the second-slot SD transmission. 
The problem is then solved in the dual domain. Simulation shows that, for this problem, the duality gap 
is also nearly zero. 

Based on the optimization results, algorithms to achieve feasible subcarrier pairing and power allocation 
are also proposed. Simulation results show that the proposed algorithms almost achieve the optimal 
weighted sum rate, and outperform the SCP proposed in Q in various channel conditions. 

The rest of this paper is organized as follows. Section HIl describes the system model. Section HID solves 
the optimization problem under the total power constraint. Detailed discussions on the practical issues 
are also presented in this section. Section |IV] solves the optimization problem under the individual power 
constraints. Section |V] formulates and solves the optimization problem for the system with additional 
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messages transmitted on the SD link in the second time slot, under both total and individual power 
constraints. Section |Vl] summarizes our results and observations. Section IVIII concludes this paper. 

II. System Model 

We consider a two-hop DF relay system consisting of one source, one relay, and one destination. 
OFDM with the same spectral occupancy is used for all links. The total frequency band is divided into 
M subcarriers. To avoid interference, for each subcarrier, only one node (the source or the relay) transmits 
in a given time slot. All time slots are of the same duration. The source transmits in the first time slot 
while the relay and the destination receive. The relay is half-duplex that receives in the first time slot and 
transmits in the second time slot. Each subcarrier used by the source in the first time slot is paired with 
one subcarrier used by the relay in the second time slot to convey a message. Therefore the number of 
subcarrier pairs in transmission is M. If subcarrier k in the first time slot and subcarrier m in the second 
time slot are paired, we call them subcarrier pair (SP) {k,m). It is assumed that the relay re-encodes the 
received message with the same codebook as the one used by the source. The destination maximum ratio 
combines (MRC) the signals from the source in the first time slot and from the relay in the second time 
slot pertaining to the same message to exploit the spatial diversity. The messages transmitted on different 
SPs are assumed to be independent. 

The channel model associated with SP {k,m) is shown in Fig.[T] We use hf^, hf^, and /if,^ to denote the 
channel gains of the SD link, SR link, and RD link on subcarriers k, k, and m, respectively. Og^ j,, a^^ ^, 
and <5\j) ,„ are the variances of the additive white Gaussian noises (AWGN) in the corresponding channels. 

we use a, = -4-^, ar = M^, and a„, = to denote the normalized channel 
gains. The channels are assumed to remain constant in a two-slot period. All the normalized channel 
gains are assumed known at the source which will perform subcarrier pairing and power allocation. The 
source then informs the relay and the destination of the corresponding parameters via proper control 
signaling before the data transmission. These assumptions are reasonable for the situations where the 
channel coherence time is longer than the sum of the CSI measurement and feedback time, the control 
signaling time, and the data transmission duration. 

In practical implementation, the channel gains can be measured at the relay and the destination during 
the training period preceding the data transmission period. The training period has a similar structure as 
the data transmission period in which the source transmits training signals during the first time slot while 
the relay and the destination measure the SR and SD channels, respectively. The relay then transmits 
training signals in the second time slot to let the destination measure the RD channel. A training slot could 
be shorter than a data transmission slot. The measured channel gains can be fed back to the source on 
dedicated reverse control channels. After the source has done subcarrier pairing and power allocation, it 
can embed the pairing and power allocation parameters in the beginning of the first-slot data transmission. 
This embedded control signal is transmitted with stronger power and/or more reliable coding. So it can 
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be guaranteed that the relay and destination can successfully decode the relevant parameters to figure out 
how to receive (and for the relay, how to forward as well) the upcoming data. 

Taking the same assumption as in 13, fSl, S-lTl, in Section Hill and Section [TVl we first consider the 
scenario where for each SP {k,m), the source only transmits in the first time slot. Even if it is decided 
that the relay will not transmit on subcarrier m, the source is not allowed to use this idle subcarrier in the 
second time slot. In Section |Vl this restriction is relaxed and the source is allowed to transmit additional 
messages in the second time slot on the subcarriers not used by the relay. This model has also been 
investigated in |4| which assumed fixed subcarrier pairing with SPs {k,k),k= 1,2, .. . ,M. Together with 
unweighted rates, the {k,k) subcarrier pairing makes determination of whether the relay will be active 
for SP {k,k) and optimal power allocation among the SPs easier to solve. However, it is inferior and less 
general than the joint optimization of subcarrier pairing and power allocation considered in Section |Vl 

For the sake of generality, we consider weighted sum rate as the performance metric. A weighting 
factor Wi > is assigned to the rate transmitted by the source on subcarrier k to reflect different priorities 
or quality-of-service (QoS) requirements. 

III. Weighted Sum Rate Maximization under Total Power Constraint 

In this section, we consider joint optimization of subcarrier pairing and power allocation to achieve 
the highest weighted sum rate under the total power constraint. We first give the problem formulation. 
Then a solution in the dual domain is given. The duality gap and achievability of the optimal solution, 
together with some practical algorithm design issues, will be discussed. 

A. Primal Problem Formulation 

For a given SP {k,m), let Rk^m be its achievable weighted rate, and pf^ and pf^ be the source power 
in the first time slot and the relay power in the second time slot, respectively. Depending on whether the 
relay is active, this SP may work in either the relay mode or the direct-link mode. In the relay mode, the 
half-duplex relay forwards the message on subcarrier m in the second time slot. In the direct-link mode, 
the relay does not forward, and only subcarrier k of the SD link in the first time slot is used to transmit 
the message. Thus the weighted rate achievable with Gaussian codebooks for SP {k,m) can be expressed 

as m 

f y log( 1 + af^pf j, direct-link mode 

Rk,m = \ w. (1) 

[ y min {log (1 + afpfj , log (l + pf,,, + /^f „,) } , relay mode 

where the rate is scaled by ^ because the transmission takes two time slots. 

Under the total power constraint of pk^m = plm^ Pfm ^'^^ {k,m), using relay is advantageous 

in terms of maximizing the achievable rate when 121 

af >af and C > <• (2) 
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In addition, based on the fact that, for the relay mode, the achievable rate is maximized when the amounts 
of received information at the relay and the destination are the same, the expressions in ([U can be unified 
as 11 



Rk,m = ^\og{\ +ak^„,pk^, 



(3) 



This is obtained by letting 



Pk., 



pin 



^ Pk,im 



^SD 



Pk,m, relay mode 

direct-link mode 
picm, relay mode 

^ 0, direct-link mode 



a 



SR 



-,SD 



(4) 



af^ + a 



RD . 



^SD 



in ([U, and defining at.m as the equivalent channel gain given by 



SR„RD 



ay; a 



k ^"-m -'^'k 
SD 



SD ' 



relay mode 



(5) 



, direct-link mode. 

Thus, when the channel gains are known, for any possible pairing, whether a SP {k,m) should be in the 
relay mode or the direct-link mode, and the maximum achievable weighted rate of this SP as a function 
of the total power p^. ^,, can be derived immediately. Define an indicator tk^,„ which is 1 if SP {k,m) is 
selected, and otherwise. The weighted sum rate optimization problem can be formulated as 

M M 



max ^ £ 

'''' k=lm=l 
M M 



Wk 



log(l + 



k=\_m=i 
M 

^tk,m = 1, Vm, 



k=\ 
M 



1, \/k, 



m=l 

Pk,m > 0, \/k,m, 
h.m G {0, 1}, yk,m, 



(6) 
(7) 
(8) 

(9) 

(10) 
(11) 



where P is the total power constraint, p S 111+ ^''^ (with 11+ denoting the set of nonnegative real numbers) 
and t G {0,1}*^^*^ are matrices with entries pk^m and f<.„„ respectively. Since the power allocated to 
the unselected SPs does not contribute to the weighted sum rate, it is obvious that the optimal solution 
wiU only allocate non-zero power to the selected SPs. Although similar in the approach, there are some 
significant differences between the above problem formulation and the ones in |T| and lITTI . ||T1 and iflTI 



DRAFT 



7 



both did not consider the SD Unk and destination combining when the relay is used. In ifTTl . the power 
allocated to each subcarrier is fixed. As mentioned in Section IH the relays in |[T1 have to use complicated 
re-encoders with codebooks different from that of the source. These differences make our optimization 
problem distinct from HI and ifTTj . 

The above problem is a mixed integer programming (MIP) problem which is hard to solve. Therefore, 
as in |[T4l ifTOll . we relax the integer constraint of (fTTI) as G IR+,V^,m. This continuous relaxation 
makes tt^m the time sharing factor of each SP. The relaxed problem then becomes 

max -yytk,„,Wklog[\+ai,,„^) s.t. (IT]),®,®,^!!, and (12) 

tk,m > 0, \/k,m. (13) 

Note that the value of the objective function (fT2]) is the same as that of the original objective function 
^ when tk_,„ e {0, l},V^,m. This objective function is concave because it is a nonnegative weighted sum 
of concave functions in the form of Jclog(l + ^) which is concave in {x,y) [14|. Since ([T2l ) is a standard 
convex programming problem, it can be solved by numerical search algorithms such as the interior-point 
method EOl . However, the optimal f|(.,m may not be integer-valued. Therefore, we opt to solve this problem 
by the dual method which can provide an upper bound for problem (fT2]) (by the weak duality |l20l). In 
Section UlI-BI it will be shown that the solution obtained by the dual method has G {0, l},V^,m. 

B. Dual Problem 

By duaUzing constraints Q and dUl, we obtain the Lagrangian as follows: 



\ ^ ( Pk \ 

^ i:=lm=l V 'k,m / 

/mm \ M / M 

+^ r ~ E E P't.'" + E h ~ E 



N (14) 

M M \ M / M \ 



k=\m=\ J m=l \ k=l 



where fj G 1R+ and a (the vector of a,„) E R are the dual variables, with R denoting the set of real 
numbers. The dual objective function is 



and the dual problem is 



h{iLi,a) = max L{p,t,iLi,a) s.t. @ , (HO]) , (HI (15) 
pt 



min h(^,a) s.t. ^>0. (16) 



It is well known that a function can be maximized by first maximizing over some of the variables, and 
then maximizing over the remaining ones ll20l Sec 4.1.3]. Thus we first solve pk,m for (031) by 

dL _ tk^mWk _Wk 1 

dpk,m~ 2 l+Gk,^!^ ^~ 2 + 



Pk„ ^ = ^ 1 , Pt.,„ -^" = (17) 
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with constraint (ITOl) . The optimal solution is 

Pk,m — hm 7. , (io) 

L ^k.m _ 

where x'^ = max{x,0}. This is similar to the result of multi-level water-filling |[T9l . We then rewrite ([141) 
as 

(19) 



k=lm=l 



where 



Wk. 



Xk.m = ^ log 1 



1^1 



1 



2iu 



M 



(20) 



(21) 



m=l 



We give an intuitive explanation for each term in Xj. The first term can be viewed as the rate obtained 
by selecting subcarrier m in the second time slot for subcarrier k in the first time slot, is the penalty 
of selecting subcarrier m in the second time slot. The last term is the price of power consumption. 

Due to the fact that K{fi, a) and Xf;,m are independent of t, we can easily find the optimal t for (fTSl) 
with constraints Q and ( [T3l) as 



'■k,in 



I, m = arg max „ 

fl7=l M ' 

0, otherwise 



(22) 



In operation, we first assume that /j and a„,'s are given. Then the power allocation for every possible SP 
can be computed by ([TSl l (with tk_,n ignored). These power allocation values are used in (l20l ) to compute 
Xi m's. After that, each subcarrier k in the first time slot will independently select the subcarrier in the 
second time slot that gives the largest „, to maximize the the dual objective function ([TSl l. 

The last step is to find the values of ju and a which minimize h{iu,a). Using the subgradient method 
|[T5]| . the values of fi and a can be found iteratively as 

/ MM 



i+l) 



(0 



(0 



a 



{'■+1) 



a. 



. k=\m=l 



1 



{'■) 




(23) 



m ■ 



1, 



k=l 



where the superscript (/) denotes the iteration index, and y^'^ and z^'^ are the sequences of step sizes 
designed properly. With the new /u and a in each iteration, the subcarrier pairing and power allocation 
can be updated with (l22l ) and ([TSl l. respectively, for the next iteration. As the number of iterations 
increases, ( [231) will converge to the dual optimum variables II15I . The optimal a, together with (l22l ). 
make f^,„'s satisfy ^ and 



DRAFT 



9 



Note that with the optimal power allocation given in (fTSl ). the achievable rate for SP {k,m) is 



1 



log 1 + ak, 



2^1 



1 



1 



■log 1 + Uk^^Wk 



1 

2^ 



1 



Wkak., 



(24) 



From (l24l ). the impact of the weighting factors can be viewed as weighting the channel gain of SP {k,m) 
by w^. A higher weighting factor results in more power allocated to the corresponding SP. 



C. Discussion on the Duality Gap 

For problem the optimal subcarrier pairing scheme may change as the total power constraint varies. 
Thus the maximum weighted sum rate as a function of the total power constraint may have discrete 
changes in the slope at the transition points where the optimal subcarrier pairing scheme changes. An 
example is shown in the circled region in Fig. |2] for M = 2 subcarriers. This phenomenon is similar to 
that observed in the optimal resource allocation for OFDMA downlink systems fTTl . However, in our 
case, this phenomenon is observed even when the weighting factors for all subcarriers are set to the 
same. As discussed in |[T6l lITTl . the nonconcavity shown in Fig. |2] may result in nonzero duality gap. 
Let us denote the optimal values of the original problem the relaxed problem ([T2l ). and the relaxed 
dual problem (fT6l) by Rg, Rr, and Dr, respectively. The relationship between them is Rb < Rr < Dr. 
Since the optimal f^,„'s found by solving ([TSl l and (fT6l) satisfy ([D, dUl and (fTTl) . we conclude that Dr is 
also the dual optimum value for problem Q. 

According to |[T6l |[2ll ifTTl . the duality gap is zero if the optimal value of the optimization problem 
is a concave function of the constraints. |[T6l and ifTTl also showed analytically and through simulations 
that the concavity will be satisfied as the number of subcarriers becomes large. In our case, we found 
that the concavity is mostly satisfied when the number of subcarriers is reasonably large. Specifically, 
when M = 2, we have observed in simulation that only about 1% of the possible channel reaUzations will 
result in the nonconcavity shown in Fig. [2] When M = 4, the probability of nonconcavity is about 0.4%. 
For M > 6, the maximum weighted sum rate is almost always concave in the total power constraint. An 
example is shown in Fig. |2]for M = 8 subcarriers. Thus, for practical OFDM systems, the duality gap is 
virtually zero, and Rg ^ Dr. We can then conclude that Rg^ Rr^ Dr for most practical OFDM systems. 
This will be verified by the simulation results in Section |Vll 

D. Algorithm Design 

Combining (l22l ). ([TSl l and (1231) . the algorithm to find the optimal subcarrier pairing and power allocation 
can be designed as in the upper part of Table H However, through simulation, we have observed that 
although (l22l ) guarantees that each row of t has only one "1", some of the "l"s may be on the same 
column. This corresponds to the situation where more than one source subcarriers select the same relay 
subcarrier. As a result, the constraint dH) is violated, and the solution is not feasible. This situation usually 
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arises in two scenarios. The first is when there are more than one source subcarriers with very strong 
SD gains, such that no matter which relay subcarrier they are paired with, the direct-link mode will be 
selected. For any of these source subcarriers, the power related terms in Xk „, (l20l) are the same for all 
relay subcarriers. Thus the relay subcarrier selection ((22)) depends only on a,„. The source subcarriers 
with this property will select the same relay subcarrier. The other scenario is when a source subcarrier 
gets a low equivalent SP channel gain a^ ^ no matter which relay subcarrier it is paired with. All the 
possible SPs formed by this source subcarrier will be allocated very little power, thus their X<,,„'s are 
dominated by the corresponding a„,'s. Similarly, the source subcarriers with this property will most likely 
select the same relay subcarrier. 

To handle this situation, we include an amendment algorithm in the original algorithm as shown in 
the lower part of Table Jl Based on the above discussion, the basic idea of the amendment algorithm is 
to each time move a "1" in a column of t with more than one "l"s to the column with no "1" that will 
cause the minimum change in the value of a,„. By moving a "1" to another column with a similar a„, 
value, the weighted sum rate will not be lowered much. When doing so, the amendment algorithm will 
make sure to keep the "1" corresponding to the largest X<.„, for each column with more than one "l"s. 
It will also move the redundant "l"s to the columns with no "1" that will result in as large values 
as possible. Thus the resultant weighted sum rate will be maximized. Eventually the pairing scheme t 
altered by the amendment algorithm will meet the constraints ^ and 

The amendment algorithm is triggered when the dual variables converge to a certain degree (for the 
example in Table Jl within 1%). Once the amendment algorithm is triggered, the algorithm will continue 
to run for another 10% of iterations. For example, if the amendment algorithm is triggered at the 1000th 
iteration, the algorithm will run another 100 iterations before it outputs the solution. For each of these 
10% of iterations, a feasible pairing scheme will be obtained by the amendment algorithm. Using this 
pairing scheme, regular water-filling over parallel channels will be applied to obtain the optimal power 
allocation and the corresponding weighted sum rate. The best pairing scheme and power allocation among 
these iterations that achieve the highest weighted sum rate will be the outputs of this algorithm. As shown 
in Section |Vll the weighted sum rate obtained by the algorithm in Table U is quite close to the optimal. 

E. Complexity Comparison 

The total number of all possible pairing schemes is 0{M\). With a fixed subcarrier pairing scheme, 
the complexity of computing the optimal power allocation ([TSl ) for the selected pairs is 0{M) in terms 
of multiplications. The complexity of computing the resulting weighted sum rate (weighted sum of Q) 
is also 0{M) in terms of log(-) operations and multiplications. Thus the complexity of exhaustive search 
is 0{M-M\) which is prohibitively high. 

On the other hand, in each iteration of the algorithm in Table H the complexity is dominated by 
the computation of Xii^i„,\/k,m, in (l20l ). That complexity is 0{M^) in terms of log(-) operations and 
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multiplications. For the amendment algorithm in the last 10% of iterations, alteration of the pairing 
scheme t takes only 0{M^) additions and max(-) and min(-) operations. The complexity of computing 
the optimal power allocation and the resulting weighted sum rate is 0{M) multiplications and log(-) 
operations. Therefore the overall complexity for the algorithm in Table U is 0{JM^), where J is the 
number of iterations. This complexity is much more feasible and tractable. 

IV. Weighted Sum Rate Maximization under Individual Power Constraints 

When the source and the relay have individual power constraints, the weighted sum rate maximization 
problem becomes 



Wk 

max — 

Ps,Pr,Ss,Sr 2 



^ log (1 (25) 



+ £ min{log(l+afMJ,log(l+af/.f,,„ + af;.f,,„)} (26) 

s.t. pI,„, pI„> 0,yk,m (27) 
E pL<Ps (28) 

{k,in)eSs'JSR 

I pL<Pr (29) 

{k,m)(ESR 

where and Pr are the source and the relay power constraints, respectively, and G M.^^^ and 
Pn G R^^^ are the matrices of pf^ and respectively. Ss and Sr denote the sets of SPs operating 
in the direct-link mode and the relay mode, respectively. If we let fj.,,, = 1 when {k,m) G 5sU5s and 
tk.m = Otherwise, 55 and Sr must satisfy the additional constraints (fS]) and 

This problem is very complicated. Because the condition to use relay depends not only on the channel 
condition, but also indirectly on the source power and relay power constraints 131 S, it is not possible 
to classify the SPs into the direct-link mode or the relay mode in advance to use the unified weighted 
rate formulation (O and the equivalent channel gain Q. In Section ITV- A[ we will first investigate optimal 
power allocation with fixed subcarrier pairing under individual power constraints considered in 121 |f3l| 
0. Through some insightful observations on the results of L3J, we will find that the unified weighted 
rate formulation ^ and the equivalent channel gain ^ can, in fact, be applied to the dual problem of 
(l25l) . After that, (1251 ) can be solved similarly as in the total power constrained case. 
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A. Unified Rate Formulation 

For (|25] ). assuming fixed subcarrier pairing, a Lagrangian similar to E eq. (8)] with slight changes 
can be obtained 

L= I ylog(l+«J 

{k,m)eSs 

+ E ylog(l+afpt« + «J 

\ k=l J V (A-.m)e5R / 

_L V n (nS'^r.S ^^OS „RD^R \ 

(/r,m)(£5fi 

where /v^ > and > denote the Lagrange multipliers corresponding to the source power constraint 
Ps and the relay power constraint Pr, respectively. The third Lagrange multiplier „, > corresponds to 
the condition 

4''plm><Pl„r+^fn"pln (31) 

for the SP {k,m) to operate in the relay mode S H. If (|3T]) is not valid for a relay mode SP, apparently 
some of that SP's relay power can be reallocated, without reducing its rate, to other relay subcarriers to 
improve their rates, or simply to be conserved. Following the same procedures as in 131, a SP {k,m) can 
be further classified into the following three modes given that /ug and jur are fixed: 

Direct-hnk mode : af" > af or < af— 

Relay mode : af^ > af^ and a'^^ > af— (32) 

Intermediate mode : af > af and a^ = af—, 

where the intermediate mode is a special case of the relay mode with the condition (|3TI ) satisfied with 
strict inequality (and the corresponding pj.„, = 0). That is, the relay receives more information than the 
destination. Thus the relay mode here is redefined to include only the SPs that satisfy (|3TI ) with equality. 
That is, the amounts of received information are the same at the relay and the destination. According to 
i3l . usually there is at most one SP in the intermediate mode. For both the relay mode and the intermediate 
mode SPs, the solutions that maximize the Lagrangian (l30b will make its last term zero. 

The first condition af > af for selecting the direct-link mode over the relay mode for SP {k,m) 
is based on the fact that, in this situation, the destination will receive more information than the relay. 
Then there is no need to use the relay. The second condition a^,^ < af^ ensures that the direct-link 
mode will contribute more to the Lagrangian (l30l ) than the relay mode. For example, for SP {k,m), if the 
power "cost" for selecting the direct-link mode, f^spfm' ( l30l ) is kept the same as the power cost for 
selecting the relay mode, IJsPkm~^/^RPkm' when a™ < afj^< selecting the direct-link mode will result 
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in a higher weighted rate (hence larger value of (l30l )) than selecting the relay mode. This is implied by 
the "single sum power constraint" approach in H eqs. (39) and (40)] with modified power and channel 
gain variables. 

By solving (l30l ) for weighted sum rate maximization, the power allocation can be obtained 



Direct-link mode : 



Relay mode : 



pI, 



pL 



Wk 



1 



Pk,n 







2{fJs + fiR/^k,m) a] 



1 

1r 



Pk,n 



Wk 



(33) 



Intermediate mode : According to 1 + af^pl ,„ + afj^pf ^^ 



Wkaf 



.RD 



2^s 



where 



and source and relay power constraints, 

^k.m 



-.RD 



-,SR 



(34) 

Power allocation for the intermediate mode SP can be computed after the power allocations for the 
direct-link mode and relay mode SPs are done. Note that since the relay mode SPs must satisfy (OTI ) 
with equality, it is clear that pf^^ = ^k.mPfm- Frorn ( [32l ) we know that for the relay mode, ^k.m > 0. Thus 
pf^j and must be zero or positive simultaneously. This can also be seen from the relay mode power 
allocation in ( [33l ). This observation allows us to allocate total power pk^^ = Pkm~^ Pfm ^'^ relay mode 
SPs first according to 



Pk,m — {?>k.,m + 1; 



Wk 



1 



(35) 



then obtain the corresponding pf^^ and pf^, using the relay mode power distribution in (01). To this end, 
the unified weighted rate expression in (|3]l with ^ and the equivalent channel gain ^ can be applied 
here as well to the direct-link mode and relay mode SPs, when ^5 and /ur are fixed. 

As to the intermediate mode SP, we examine its contributions to the rate and cost in the Lagrangian 
(l30l ) and find that, with ,„ = 131, they are 



rate = log (l + + pf,„,) 



Wk 



log 



( wk4^ 



cost = fisplm+ win 



.,RD 



S ^R R 
- Ms { Pk,m + —Pk,n 
i"5 



■ f^S Pk.m + -snPk.n 



SD Pk,n 



(36) 



^RD„R 



Pk,m) 



Wk 
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If the intermediate mode SP was classified as a direct-link mode SP, it would contribute to the Lagrangian 
^ with 

S f Wk I 

On the other hand, if it was classified as a relay mode SP, with Pk.m = "' aSR^Ro = 0> its contributions 
to the Lagrangian (l30l ) would be 

rate = ^ log (l + pf_„, + J^^pl,) 




■log 



2 V 2a^s 

y^k Pk.m^'^m Pk.in) — SD V^k Pk.r 
1 



(38) 



^k 



l{flS+f^R/^k,n 



-,SR 



Wk 1 

2a^5 



Interestingly, with given /ns and /^s, the intermediate mode SP's contributions to the Lagrangian (l30l) 
remain the same no matter it is classified to the direct-link mode or the relay mode. Thus, in terms of 
maximizing the Lagrangian, we can assign the intermediate mode SP to either mode without affecting 
the result. However, once the optimal jUs and /ur are obtained, we still need to identify the intermediate 
mode SP and allocate its powers according to (1331 ). 

In the following, we will assign the intermediate mode SP to the relay mode. Together with the 
conclusion that the unified weighted rate and equivalent channel gain expressions can be applied when 
fus and fiR are fixed, the dual problem of (l25l ) can be formulated with unified expressions. 
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B. Dual Problem 

By dualizing ([8]l, (|28] ). ( |29l ). ( |3T| ). letting f and p be the matrices of tk,m and Pk,m, respectively, and 
applying continuous relaxation to tk^m^ as in Section UlI-AI we have the following Lagrangian 



MM Wk ( Pk \ 

k=lm=\ ^ \ '^k,m / 

/mm \ 

+ ^^S\Ps-Y.lL ^k,m Pk,m 



k=l m=l 
M M 



\ k=lm=l J 

M / M \ 



m=\ \ k=l 



where 



_L V r. (^S'^r,^ n^DS „RD„R \ 
+ l_ Pk.m [at Pk,m - <^k Pkjn " «m PLm) i 

{k,m)eSR 



' when af > af and a"^ > af 



cik,? 



+«m -«yt ^^S (40) 



^k ' 

"n7 



otherwise 



when af > and > df^'^ 



c I ^,SR I „RD „SD ' " — " , , 

cl^= ^ % +a„^ -au fJs (41) 

1 , otherwise 

^,SR_„SD ,, 
when ^■^^ ^ n^'^ and > n^D^R 

-SR—Z^^—SD^ whena^ > and a„ ^ ^k — 
_ . +«m f^s (42) 

0, otherwise 

are the equivalent channel gain, the portions of pk^m distributed to source power and relay power, 

respectively, for the two modes specified in the conditions. Similar to ([T6l ). the dual problem associated 

with (|39l ) can be expressed as 

min h{iLis,iJR,a) s.t. fis>0, /ur>0 (43) 

with 

h{iJs,lJR,a) = max L{p,t,iJs,IJR,a,p) s.t. , (HO]) , (Hi]) . (44) 
p.'p 

Note that the source and relay power distribution (|4TI ). (l42l ) satisfy the constraint af^pf^ ~ ^¥'Pkm~^ 
'^nFPkm relay mode. Thus the last term in ( [39b is always zero. Following the same procedure as 

in Section IIII-B I and applying the results in Section IIV-AI the optimal power allocation for (l44l ) can be 
solved as 



Pk,m - fkj 



+ 

Wk 1 



(45) 
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The optimal tk^m can be solved as 



1, m 
0, otherwise 



arg max Z^,,„ 

ni=l M 



(46) 



where 



Wk 



-^k.m 



log 1 + ak^, 



Wk 



Wk 



ak,i 



(47) 



Wk 



1 



_2(cLa'5 + cLa'«) ak. 

Since assigning the intermediate mode SP to the relay mode does not change the dual value, we can 
approach the dual optimal value by the subgradient method. The Lagrange multipUers fis, Hr, and a are 
updated by 

/ MM ^ \ 

Ps-Y^Y^ Ck.mPlm ' 



('•+1) 



(/+i) 



(0 



a 



i+i) 



a: 



k=\ m=\ 
M M 



1 k=lm=\ 



(48) 




M 



k=\ 



''Km 



m ■ 



1, 



where yg\ jjj'' and z^'' are the sequences of step sizes designed properly. When the optimal subcarrier 
pairing and power allocation do not include an intermediate mode SP, (1481 ) will converge to the optimal 
values. However, when an intermediate mode SP is present in the optimal solution, ( [48] ) may oscillate 
around the optimal values. Specifically, due to assigning the intermediate mode SP to the relay mode 
with power allocation ( |45l ). the relay power for that SP is increased, while the source power is decreased, 
to make (|3T]) satisfied with equality instead of strict inequality. Thus, even when fis and /jr are already at 
their optimal, the total source power consumption will be smaller than the source power constraint, and 
the total relay power consumption will be larger than the relay power constraint. This will result in /ug 
decreased and ^xr increased in the next iteration. Then /ur/ius will be increased, and the intermediate mode 
SP may fall in the direct-link mode according to ([32]). Similarly, this will make Hr/^s decreased, and the 
intermediate mode SP may fall in the relay mode in the next iteration. As a result, ([48] ) oscillates. Similar 
oscillation was also observed in |[3l. Thus, like in |[3l Section 3.2], the zero-crossing of the difference 
between the total source power consumption and the source power constraint can be used to determine 
the optimal Hr/hs and the corresponding mode classification and power allocation. However, due to the 
issues discussed in Section IIII-D[ we have found that the optimal zero-crossing is very difficult to trace 
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when subcarrier pairing, mode classification and power allocation are updated at the same time. In the 
algorithm given in Table Ull similar to Table IH the amendment algorithm is used to obtain a feasible 
pairing scheme when the subgradient method converges to a ceratin degree. With diminishing step sizes, 
we found that the subgradient method will eventually be stuck at assigning the intermediate mode SP 
(if it exists in the optimal solution) to either the direct-link mode or the relay mode. In both cases, the 
obtained subcarrier pairing scheme is near optimal. With fixed subcarrier pairing, and iir/hs given by the 
amendment algorithm which is already very close to the optimal, the zero-crossing method in IS Section 
3.2] can be used to quickly obtain the optimal Hr/^s- Then the corresponding mode classification and 
power allocation can be done according to (l32l ) and (1331 ). respectively. 

The algorithm in Table Ull has the same order of complexity as that of the algorithm in Table Jl Through 
simulation, we have also found that the duality gap for this problem approaches zero when the number 
of subcarriers is reasonably large. 

V. Weighted Sum Rate Maximization with Extra Direct-Link Transmission 

In the previous sections, only the relay can transmit in the second time slot. Therefore, for the SPs 
operating in the direct-link mode, the second time slot is not used. It is possible to allow the source 
to transmit extra messages in the second time slot on these idle subcarriers. We consider this modified 
system with both total and individual power constraints. 

A. Total Power Constraint 

Under the total power constraint, the achievable weighted sum rate for SP {k,m) for this system is 
r !^iog (1 +afpi^) + ^log [l+afqi^) , direct-Unk mode 

Rk,m =1^^ (49) 

[ y min {log (1 + afp^^,) , log (l + af'pl,,, + pf,„J } , relay mode, 

where p^,„, and q^^ represent the source power in the first time slot, relay power in the second 
time slot, and source power in the second time slot, respectively. By comparing the achievable weighted 
rate for these two modes, we find that the condition for using the relay depends not only on the channel 
gains but also on the power allocation. Thus we introduce an additional indicator 5<. „, related to the use 
of the relay as a variable to be jointly optimized. When Sk,m = 1> the relay is used for SP {k,m). When 
Sk,in = 0, the relay is not used. In addition, we again make continuous relaxation for the indicators and 
the same adjustment to the sum rate function. The relaxed weighted sum rate maximization problem is 
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expressed as follows 



max 
p 



+ i^-Sk,m) 



Pk,m,\ 



1 I 1 I 5D Pk.m.2 



^^,m ( l-Sk ,m ) 



+w,„log 1 +a; 



^SD Pk,m,3 



^k.mi^^ ^kjTi) 



s.t. (III),®,!!!!]) 

M M 3 

ELL - ^ 

Pk,m,r > 0,yk,m,r 
< ^A-.m ^ l,yk,m, 



(50) 



(51) 

(52) 
(53) 



where PA,m,i and are the sum power and equivalent channel gain, respectively, of the relay mode SP 
{k,m) taking the form of the relay mode expressions in ^ and (|5]l. Pk.m.2 and Pk.m3 are the powers used 
by the direct-link mode SP {k,m) in the first and second time slots, respectively, p € R^^*'^'^, ? G 



)MxM 



and s e R+''^ are the matrices of pk,m,r, h.m, and Sk.m, respectively. P is the total power constraint. 

grangi 

Pk,m,Y 

t -Ttik,m 
WAl0g(l+flf 



Similarly, by dualizing constraints ([8]l and dSTT ). we obtain the Lagrangian as 

Y M M r / 

L{p,t,S,^l,a) =T L E ^''■"^ 1 ■^-t^mW^log ( 1 



SD Pk,m,2 



^k,in ( 
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(54) 
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where € 11+ and a G R are the dual variables. Then the dual objective function is computed as 

%,a) =max L(p,f,*,A/,a) s.t. Q , ([I3D , ([521) , (l53|) . 

p,t,s 



The dual problem is given as 



The solution to (1551) is 



min h{fi,a) s.t. /J > 0. 



(55) 
(56) 



Pk,mA ~ h.m^ky 
Pk.m,2 = tk,m{^ — ^km) 
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1, >flf andy^,„>y,^, 

0, otherwise, 
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are the SP {k,m)'s contribution to the Lagrangian when it is in the relay mode or the direct-link mode, 
respectively. The condition af^ > af^ in (1581 ) is necessary. The reason is that the value of Y^^ is 
meaningless when af^ < af^, since it is impossible to make the relay receive more information than 
the destination. The sl^^ tells us whether it is better to use relay for the SP {k,m). 
The SP selection variable is given as follows 



'■k.m 



I, m = arg max ,„ 

m=l,...,M ' 



(61) 



0, otherwise 
where 

Yk,m = ^k,m^k,m + ~ ^k,m)^k,m ~ ^m- (62) 

Again, the dual optimal value is reached by the subgradient method. The Lagrange multipliers a/ and a 
are updated by 



(/+i) 



{'■), 



M M 



a 




(63) 



m ■ 



where j''' and z^'' are the sequences of step sizes designed properly. 

The algorithm to obtain feasible solutions is given in Table HID where s^ f^'s found in an iteration are 
directly used, together with the subcarrier pairing scheme t obtained by the amendment algorithm, to 
compute the power allocation and weighted sum rate. Doing so is suboptimal, as s^,,, in fact depends 
on the power allocation. However, this saves the complexity involved in joint optimization of Sk,m and 
power allocation given fixed subcarrier pairing. The algorithm in Table |lll] also has the same order of 
complexity as that of the algorithm in Table Jl We have found that the duality gap for this problem is 
virtually zero when the number of subcarriers is reasonably large. 
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B. Individual Power Constraints 

With individual power constraints for the source and the relay, the problem can be solved by combining 
the results in Section Hvl and Section IV-AI with some crucial modifications. Due to limited space, we will 
discuss only these crucial points. 

As in Section |IVj in addition to the direct-link mode and the relay mode, there may also be an 
intermediate mode in which the relay receives more information than the destination, and (|3T]) is satisfied 
with strict inequality. On the other hand, the relay mode should satisfy (|3T]) with equality. Note that given 
the same power, the rate of the direct-link mode in (l49l ) with extra second-slot SD transmission should 
be no less than the rate of the direct-link mode in ([T]). This is because the latter is a special case of the 
former with the second time slot allocated zero power. Therefore, the necessary condition af^ > af^ in 
(l32l) for the relay to be active (in both the relay and the intermediate modes) is also necessary in this 
case. The second necessary condition a^ = af^^ in ([32]) for a SP to be in the intermediate mode, as 
derived in ||3l, is directly related to having (|3TI ) as a strict inequality. Thus it is also necessary in this case. 
The second condition af^^ < af^^ in (l32l) for selecting the direct-link mode over the relay mode was 
derived by comparing the achievable rates of the direct-link mode and the relay mode when they have the 
same power cost in the Lagragian (l30b (see the discussion after (l32l ) and HI). With the extra second-slot 
SD transmission that can improve the rate for the direct-link mode, this condition may change. In fact, 
with the extra second-slot SD transmission, the direct-link mode may possibly be selected even when 
fl^^ > af^^- For the special case with fixed {k,k) subcarrier paring, lH has derived the exact condition 
which also depends on the allocated power. In our case, the subcarrier paring is variable and may not be 
the trivial {k,k) pairing. Due to this reason and different weighting factors in the rate of the direct-link 
mode (|49l ). the exact condition based on having the same power cost is complicated and dependent also 
on the weighting factors. However, we may simplify the condition by comparing the contributions of the 
direct-link mode and the relay mode to the Lagrangian. This approach is similar to using (l58l ) to select 
modes to maximize the Lagrangian dSSl ). 

The fact that the direct-link mode may also be selected when a™ > flf^^ implies that the region for 
the intermediate mode to occur may be encompassed by the region for selecting the direct-link mode. 
That is, the intermediate mode may no longer exist, except in the special situation where the optimal 
power allocation for a direct-link mode SP results in zero power for the second-slot SD transmission. 
For a SP with this property, there will be no second-slot SD transmission if the direct-link mode is 
selected. Then the situation becomes the same as in Section |IVl Thus (l32l) can be used to select modes, 
and the unified rate formulation discussed in Section IIV-AI can be applied with the intermediate mode 
SP assigned to the relay mode. 

In summary, we can assume that there are only the direct-link mode and the relay mode, and apply 
the results in Section IV-AI with the following changes. The relay mode power allocation p\^i in (|57] ) 
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takes the expression of the right-hand side (RHS) of (l45l) multiplied by s^.m, where cf^ and c^,„ are the 
portions of pl^i distributed to source power and relay power defined by the relay mode expressions in 
(|4T]) and (|42]l, respectively. The RHS of is replaced by the RHS of (07]) with -a„, removed, fi in 
(l60l ) is replaced by ^5. In each iteration, /us, ihr and a are updated as in (l48l) using the power allocation 
computed in (|57] ) ipl„ji computed by the RHS of (1451 ) multiplied by si^ „j). If there is an intermediate 
mode SP in the optimal solution, it must belong to the situation where the conditions (l32l ) are applicable. 
Then, like in Section ITV-BI the zero-crossing method in 131 Section 3.2] can be used to obtain the optimal 
Hr/^is- The corresponding mode classification and power allocation can then be done accordingly. 

VI. Simulation Results 

This section provides the simulation results for the rates obtained by the proposed algorithms, and 
the dual optimum values which serve as the performance upper bounds. The performances with fixed 
subcarrier pairing and with the SCP proposed in 151 are also presented for comparison. Note that the 
original SCP in [51 considered only the unweighted sum rate. It first sorts the subcarriers of the SR link 
and the RD link, respectively, according to their normalized channel gains, then pairs the SR subcarrier 
with the RD subcarrier having the same rank. For weighted sum rate, according to (l24l ) and the discussion 
right after it, we modify the SCP such that Wkaf^ and af^f^ are sorted first. Then the SR subcarrier and 
the RD subcarrier with the same rank are paired. The RD link channel gains are not weighted for the 
reason that we do not know the actual subcarrier pairing scheme in advance. 

The channels of different links are assumed to be independent of one another. The channels of the 
subcarriers are independent and identically distributed (i.i.d.) Rician fading channels with ^-factor = 1. 
They are assumed constant within each two-slot period, and varying independently from one period to 
another. The AWGN variance is assumed to be one. The total power constraint is set as P = 5. As for 
the cases with individual power constraints, the source power constraint is = 4 and the relay power 
constraint is Ps = 1 . These constraints are set with the practical consideration that the relay usually plays 
the role of assisting the transmission and/or extending the coverage, and has a smaller power than the 
source. In addition, when the relay is allowed more power and the achievable rate becomes limited by 
the source power constraint, some of the relay power will not be used. Setting Ps = 4 and Pr = 1 reduces 
the occurrence of this situation and makes the comparison with the total power constrained case fairer. 
For all cases, the SD link is present, and the destination performs MRC whenever the relay is used. 
The SCP schemes first establish subcarrier pairing using SCP. Then, in the total power constrained cases 
(including the case with extra direct-link transmission), (|2]) is used as the condition to use relay. In the 
individual power constrained case without extra direct-hnk transmission, the method in 121 is used for 
mode classification and power allocation. For the individual power constrained case with extra direct- 
link transmission, the method in f3] cannot be used because the optimal mode classification conditions 
are no longer (l32l ). Naively using (l32l ) and (1331 ) may result in invalid power allocation as they are not 
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the solutions in this case, and will affect the fjg, fiR values through the iterations. On the other hand, 
modifying the method in H to accommodate weighted rates is tedious. Thus, the algorithm discussed in 
Section rV-B I is used with fixed subcarrier pairing from the SCR The fixed pairing schemes use the same 
mode classification and power allocation procedures as that of the SCP schemes. In Figs. |3l |5l [6l [H |9] 
and [TT] the unweighted sum rate is considered. That is, the all-one weighting factor is used. In Figs. |4l 
It] and [TOl the weighted sum rate is considered with Wk = 1 + j which is used only as an example 
with a concise expression. 

For the proposed algorithms, /u, /ur, /us and a„,'s were randomly initialized to be between and 2 for 
each two-slot period. For each number of subcarriers (e {4,8,16,32,64}), 1000 such two-slot periods 
were simulated, and the results averaged to avoid favoring certain initial conditions. The step sizes for 
the subgradient method were all set as ^ , where / is the iteration index. In the simulation, we observed 
that the number of iterations before the amendment algorithm was triggered depends on the number of 
subcarriers. The number of iterations needed ranged roughly from a few hundreds for small numbers of 
subcarriers (< 10) to slightly more than 10000 for 64 subcarriers. 

We investigate three system configurations corresponding to different scenarios. In Figs. 13 |4] and ID 
the mean square channel gains of the SR, SD and RD links are 3, 1, 3, respectively. This corresponds 
to the situation where the relay is placed between the source and the destination. In Figs. |6l [V] and [H 
the mean square channel gains of the SR, SD and RD links are 5, 1 and 1, respectively, which means 
that the relay is close to the source. In Figs. |9j [TO] and (TTJ the mean square channel gains of the SR, 
SD and RD links are 1, 1 and 5, respectively, which means that the relay is close to the destination. In 
these figures, we find that, in all cases, the rates obtained by the proposed algorithms are almost equal to 
the corresponding dual optimum values. This validates the arguments in Section ITlI-C[ Section ITV-BI and 
Section IV-AI that the duality gap is virtually zero when the number of subcarriers is reasonably large. 
Even when the number of subcarriers is 4, the duality gap is hardly noticeable from the averaged results, 
because it is zero with a very high probability. These results also show that the proposed algorithms 
can almost achieve the optimal weighted sum rates. There are some other general trends that can be 
observed from these figures. One of them is that fixed subcarrier pairing incurs a significant performance 
loss. In addition, the weighted and unweighted sum rates increase with the number of subcarriers due 
to frequency diversity and more flexibility in pairing. As to the performance under different constraints, 
the performance under total power constraint is better than the performance under individual power 
constraints, due to the flexibility in power allocation. By comparing Figs. [3] and \5\ [6] and [H |9] and [HI it 
is clear that extra direct-link transmission always improves the performance. 

The SCP was proved in |6l Q to be optimal for the unweighted system without the SD link under the 
total power constraint. When the SD link is present and/or when weighted sum rate is considered, the 
performance of the SCP depends on the link qualities. The SCP almost achieves the optimal unweighted 
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sum rate for the cases with total power constraint and no extra direct-link transmission in Fig. |3] and 
Fig. |9l but becomes noticeably worse than the optimal in Fig. [6l For the scenario in Fig. |3l this is 
reasonable because the SD link is relatively weak compared to the other two links. Thus the direct-Unk 
mode is rarely used, and the SCP is nearly optimal for the relay-mode SPs given that their SD subcarriers 
are weak. For the scenario in Fig. |9j the RD link is the strongest and seldom becomes the bottleneck for 
mode selection. For the SCP as well as the proposed algorithm, mode selection is mainly determined by 
the SR and SD links. For the direct-link mode SPs, the SCP and the proposed algorithm have similar 
performances. For the relay mode SPs, the SCP is nearly optimal because the SD link is the weakest 
among the three links. Overall, the SCP has a very similar performance to that of the proposed algorithm 
which is almost optimal. As to the case of Fig. [6l we can see that since the SR link is much stronger 
than the SD link, the condition for using relay ^ is dominated by the relation between the channel gains 
of the SD and the RD links. However, the SCP does not consider the SD link in establishing subcarrier 
pairing. As a result, the SCP is almost equivalent to random pairing in terms of optimizing the mode 
selection and sum rate. Thus its sum rate is smaller than that of the proposed algorithm. The SCP is still 
better than fixed pairing because it helps the SPs that are in the relay mode. 

For the individual power constrained cases, or when weighted sum rate is considered, as shown in 
Figs. [3l m and IH |7j the gaps between the SCP and the proposed algorithms become larger. This is due 
to the mismatches between the SCP and these scenarios. To show that our modification to the original 
SCP is meaningful, we show the performance of the original (unweighted) SCP together with that of 
the "weighted SCP" in Fig. |4] and Fig. |7j These two figures clearly show that the original SCP is not 
suitable when weighted sum rate is considered. The performance gap between the "weighted SCP" and 
the proposed algorithm in Fig. [7] is due to the aforementioned "random pairing" effect of the SCP (as in 
the total power constrained case in Fig. [6]l. However, these trends do not appear in Fig. |9] and Fig. [TOl 
For Fig. m this is because the strong RD link makes the sum rate not limited by the low relay power 
constraint. Therefore, for the SCP, the situation is very similar to that with the total power constraint. 
As a result, the SCP is almost optimal. For Fig. [TOl the strong RD link makes mode selection dependent 
almost only on the channel gains of the SR and SD links. Thus, mode selection is almost independent 
of the pairing scheme and weighting factors. For the source subcarriers that have relatively lower SR 
gains and are in the direct-link mode, all schemes yield similar performances. On the other hand, for the 
subcarriers in the relay mode, pairing better RD subcarriers with SR subcarriers having higher weighted 
channel gains can improve the weighted sum rate. Both the original SCP and the weighted SCP can do 
that for the SR subcarriers that are strong enough. Thus they both perform well and almost optimally. 

With possible extra direct-Unk transmission, the SCP is worse than the proposed algorithm for not 
considering the benefits of the extra direct-link transmission (such as more diversity from the additional 
independent channels, and more flexibility in water-filling) in subcarrier pairing and mode selection. Under 
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the total power constraint, we find tliat tlie SCP is similar and even slightly worse than fixed pairing in 
Fig. [8] and Fig. [TT] This is because, without considering the possible extra direct-link transmission, the 
SCP's pairing of strong SR subcarrier with strong RD subcarrier tends to satisfy Q more than fixed 
pairing, and make more SPs use the relay. Thus it loses the opportunities to transmit more messages 
with the extra direct-link. This phenomenon does not appear in Fig. |5l for which the benefits of the extra 
direct-link transmission are not significant due to the weak SD link. Under individual power constraints, 
both the SCP and the fixed pairing schemes use the algorithm in Section IV-BI for optimal joint mode 
selection and power allocation. The SCP always performs better than fixed pairing due to its better 
subcarrier pairing. In Fig. \TT\ the advantage of the SCP over fixed pairing is smaller than in Fig. |9] 
because the optimal mode selection assigns more SPs to the direct-link mode for which better SR-RD 
subcarrier pairing does not improve the rate. 

VII. Conclusion 

In this paper we investigated OFDM point to point transmission, enhanced with a DF relay. We jointly 
optimized subcarrier pairing and power allocation to maximize the weighted sum rate with consideration 
of the source-destination link and destination combining. To the best of our knowledge, this problem 
has not been solved before. Both total power constraint and individual power constraints for the source 
and the relay were considered. The system that allows additional messages to be transmitted on the 
idle subcarriers not used by the relay, in the source-destination link in the second time slot, was also 
investigated. We solved the optimization problems by using some special properties of the systems, as well 
the continuous relaxation and the dual method. The subgradient method was adopted to find the Lagrange 
multipliers which also helped us to find the primal feasible solutions. Based on the optimization results, 
algorithms with tractable complexities to obtain feasible subcarrier pairing schemes and the corresponding 
power allocations were proposed. Simulation results showed that the proposed algorithms can achieve 
nearly optimal weighted sum rates, and outperform the method proposed in Q under various channel 
conditions. 
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Fig. 1. Channel model for subcarrier pair {k,m). 
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TABLE I 

Algorithm for the total power constrained case. 

Initialize i = 1, /j''', a''', £ = 0.01, amendment = false, max_it = oo, sum_rate = 
While (i< max_it) (** iteration i **) 

Compute xl'l^,yk,m, using a'-'"' in l [20t 

Compute /('^ using in ||22|) 

Compute using ^u^'', in (fTSjl 

Compute iJ^'+^\ a('+^' using /y*'), a('\ m ([23} 

/ l„(i+i)_„(0| N / ||a''+''-o'''ll \ 
If (amendment = false) and y ' < £j '^"'^ \^io?^+n]| 

amendment = true 

max_it = [1.1 X ij 

End 

If (amendment = true) (** amendment algorithm **) 

For (j = I to M) 
If ('^V>1)' ■''* = a''gmax{,|?^_i}Xj'], End 
While (cj > 1) 

* ■ I (') (')l 

m* = argmin{„|c_ _o} «) - Km 

r* = argmax{^|?^_i^ r^s'}^!:'},. 

tf*,] — 0, tf^'^iji'' = 1 

e j ~ Cj 1 , c'u;^ = Cfi^* + 1 

End 

End 

With fixed subcarrier pairing t, apply water-filling on the subcarrier pairs with equivalent 

channel gains in ([5} to compute power allocation p and the weighted sum rate R as in ||6ll. 
// (R > sum_rate), sum_rate = R, t = i, p = p. End 
End (** amendment algorithm **j 

1 = 1+1 
Erui (** iteration i **) 

sum_rate, t and p are the obtained weighted sum rate, feasible subcarrier pairing and power allocation, respectively. 
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TABLE 11 

Algorithm for the individual power constrained case. 

Initialize ;= 1, fJ'^\ ^1^', (x''\ e = 0.01, amendment = false, max_it = oo, sum_rate = 
Wliile (i < max_it) f** iteration i **) 

Determine modes for all possible subcarrier pairs, and obtain ai^ „^, c^^ and 
by using pf , p-^ in (|40]|. BIT) and (|42]| 

Compute z'j^^^,\/k,m, using p'^^K p''^' , a''', at,„, c^^^, cf in (|47j 

Compute i^'' tuv'^^ Z^'^^ in ()46ll 

Compute using pf, p^^\ c^^^, cf^ ||45| 

Compete //^'+'', a('+i) A-i;', a('), r('), p('), g8} 

// (amendment = false) and i^^^^^^^f^ < and i^^^^yW^ < ^) ""'^ ( ""llal'+i"'''" < 

amendment = true 

max_it = [1.1 X ;J 
End 

If (amendment = true) (** amendment algorithm **) 

For (j = 1 to M) 
If {cj>l), .s* =argmax{j|,; _i}Z['], End 
While (Cj > 1) 
m =argmm{„,|c,„=o} «) -Km 
r* = argmax{,|,; r^s'}Zl^l' 
tf*j = 0, tf^^fji* = 1 

Cj = Cj 1 , Cfn'' = Cffi^ -\- 1 
End 
End 

With fixed subcarrier pairing t, and letting ps = p^^\ PR = p^^ , use the zero-crossing method in fT. Section 3.2] 

to update ps, PR to their optimal. 
Use Ps, PR in i32l and 113 3i to obtain mode classification and power allocation p^, Pi^, 

and compute the weighted sum rate R. 
If iR> sum_rate), sum_rate=i?, i = i, Ps = Ps' Pr = PR' ^""^ 
End (** amendment algorithm **) 

i = i+l 
End (** iteration i **) 

sum_rate, i, and are the obtained weighted sum rate, feasible subcarrier pairing and power allocation, respectively. 
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TABLE III 

Algorithm for the total power constrained case with extra direct-link transmission. 

Initialize i= 1, jU^, a^, £ = 0.01, amendment = false, max_it = oo, sum_rate = 
While (i < max_it) f** iteration i **) 

Compute (Y^^^^y'\(Y^^J^'\ \/k,m, using /j^'^ in l|59j and 

Obtain i(') using (^y*„) ' , (y^^^^ ' in (glj 

Compute yj^l, "i^m, M.™g (k^J''', (y/^J in {62) 

Compute using Kjf'jli ISJ 

Compute p^'^ using t^'\ in ^\5l} 

Compute , «('■+') using fJ-'^ a^'\ f('), pW m (|63| 

// (amendment = false) and ^ < a"*^ ( ^^"||'a'''+''|P^ ^) 

amendment = ?rae 
max_it = [1.1 X ij 

End 

If (amendment = true) (** amendment algorithm **) 

For (j = 1 to M) 
If {cj>l), =argmax{,|,;_i}y/j, End 
While {cj > 1) 
m* = argminj^i^^^o} |a) ' - a^/ 1 
r* = argmax{,|,;_i_ 

^r* j' ~ 0, tf* ijf = 1 
Cj= Cj 1 , Cm* = C/n* -\- 1 
End 
End 

With fixed subcarrier pairing i and mode selection s, apply water-filling on the direct-link, extra direct-link 
subcarriers and the relay mode subcarrier pairs with equivalent channel gains in (|5]l, to compute 
power allocation p and the weighted sum rate R. 
If {R> sum_rate), sum_rate = R, i = i, s = s, p = p. End 
End (** amendment algorithm **) 

i = i + 1 
End (** iteration i **) 

sum_rate, i, s and p are the obtained weighted sum rate, feasible subcarrier pairing, mode selection and power allocation. 
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Fig. 2. Concavity of rate versus power constraint for systems with 2 and 8 subcarriers. 




Fig. 3. Unweigiited sum rates for tiie systems with E[|/!'^'^p] = 3, E[|/i'^^|2] = 1 and E[|/!*"p] = 3. 
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Fig. 4. Weighted sum rates for tlie systems with total power constraint, and E[|ft'^''p] = 3, E[|ft'^"p] = 1, E[|/i*"p] = 3 
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Fig. 5. Unweighted sum rates for the systems with extra direct-Unk transmission, and E[|/i'^*|^] = 3, E[|/i'^^p] = 1, E[|/i^^|^] = 3. 
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Fig. 6. Unweighted sum rates for the systems with Efl/j'^^p] = 5, E[|/!'^^|2] = 1 and E[|A*^p] = 1. 
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Fig. 7. Weighted sum rates for the systems with total power constraint, and E[|/!'^^p] = 5, E[|/!'^^|2] = 1, E[|/!^^|2] = 1. 
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Fig. 8. Unweighted sum rates for the systems with extra direct-link transmission, and E[|/!'^*p] = 5, E[|/!'^^p] = 1, E[|/!^^p] = 1 
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Fig. 9. Unweighted sum rates for the systems with E[|/!'^^|2] = 1, Ed/z-^^p] = 1 and E[|/!^^|2] = 5. 
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Fig. 10. Weighted sum rates for the systems with total power constraint, and Ed/j-^^p] = 1, E[|/!™p] = 1, E[|/!*^p] = 5. 
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Fig. 11. Unweighted sum rates for the systems with extra direct-link transmission, and E[|/!'^^p] = 1, E[|/z'^^p] = 1, E[|/!^^p] =5. 
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